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ABSTRACO: 

A Monte Carlo simulation vas used to ascertain the 
degree of inflation that can occur in regression estimates vhen 
samples contain randomly occurring instances of a pattern among 
correlations called cooperative suppression* INep thousand samples of 
scores on three variables were randomly drawn from a population i^ 
whicli the correlations among the variables were prespecified such 
that cooperative suppression did not exist* Cooperative suppression 
occurred in . nearly ^8% of the samples but the incidence of regression 
coefficients that fere grossly discrepant from the population 
parameters^was rare* The inplicTBtions for multiple linear regression 
and a method of causal investigation called path analysis are, 
discussed. (Author/CTfi) i 
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A MONTE CARLO INVESTIGATION OF . • 

SPURIQUSLY. INFLATED REGRESSION ESTIMATES _ ' . . ' ' 

' Thomas. D, Jovick 

* ' • ' Introduction . ' . . 

' 'Pie fields of educational research iand evaluation have, recently 'cfiprressed 
interest in tjie method of path analysis a^ a means- of determining causal relar- 
tionships among variables in survey data. Like any other statistical procedure, 
it possesses properties* which have not yet been fully as'c^ert:a:iiw4»"^^^^ usin^ 
multiple linear regression as its basic analytical tool, patri aUalysis relies 
heavily on .sample regression estimates to^ reflect the validity of, an^ make 
causal interpretations about, a hypothesized model oF causal relationships . 
among variables. Little has been documented -on the likelihood of obtaining 
regression estimates spuriously inflated beyond their popMation counterparts. 
Such inflations may occur when the pattern of positive and negative signs of 
the correlations among the variables produces a statistical phenomenon called 
c oop er at i ve suppre s s i on (Cohen and Cohen, 1975) , 

This study provides documentation bearing on spurious instances of ^ 
cooperative suppression and the resulting inflation of regre,ssion coefficients 
in three- variable regression equations, and discusses implicati^s of the 
results for path analysis. It attempts to provide such data by addressing 
four concerns: s 

1, How often will tile pattern among ^co^relationa that is charaCTSristic 
of cooperative suppression tend to occur in a sample just by chance? 
Most simply the answer involves a record of the proportion of time 
one expects to find samples in which v^^ is negative and r^^ and t^^ 



In those instances where <he pattern does occur; ' ' 

2, How often will the negative correlation between independent variables 
be significant, allowing one to infer the presence of genuine cooper- 
ative suppression? , > 

5. How much overestimation can be expected in the estimates When the : 
pattern comprising cooperative suppression appjears? Thi& would* 
tell how much the inflated estimates in general tenjd- to deviate 
from their respectij)e population values. ' ^ * 

4. What size inflations -generally occur by chance and hov|much of a 
problem are they? Do researchers have much reason to be conceriaed 
-about those estimates being grossly inflated and therefore gi^ssly 
misleading? 

Although path analysis may employ equation;s involving several- independent 
variables, this paper will Umit itself to those with Inly two independent 
variables. The immediate tie to path models, then, is with those using a 
series of three-variable relationships. ^ 

As an attempt to inform these concerns, this study focused on the random 
occurrence of the cooperative suppression pattern among correlations in a two- 
independent-variable regression equation and the ensuing ^influence on the 
regression estimates. The analysis involved a two-stage process. First, a 
Monte Carlo simulation- program generated 10,000 samples of 100 "scores"^ for 
each variable randomly draxm from a population in which the correlations among 
theVjihree variables had been pre-s|)ecif ied such that cooperative suppression 
did not exist. For each sample the program calculated correlations and regression 



estimates and recorded their signs and if they were significant for alpha = 
,05 fone-tailed) . The second stage involved inspecting the data generated 

f 

by the Monte Carlo program. This was done using, prpgrams in the Statistical ' 

Package for the Social Sciences (SPSS) to examne in mo^'e detail particular 
Samples in which cooperative suppression effects occur.. j 



, Cooperative Suppression and Its effects on Regression, Estimates 

Path auialy^is has been used quite extensively in cross-sectional sociol- 
ogical research as a means of testing theories. Although it cannot prove 
causality, it dies purport to determine whether a pattern of intercorrelations 
among a set of variables can be meaningfully explateed by a particular /tteo- 
ret'ical formuiation about ordered ^relationships . The causal validity of the 
model itself rests primarily upon substantive, empirical and conceptual co'n- 
siderations (Kerlinger and Pedhazur, 1973; Ainick and IValberg^ 1975; Namboodiri, 
et al, 1975). ' ^ ^ ' ^ 

The basic inferential tool crucial to the method is multiple^ linear, 
regression which allows one to examine the magnitudes and di^ct;ions of effects 
and their statistical significance while controlling for mutual inf^uencQS 
among independent variables. The hypothesized causal model itself can be repre 
sented by a set of multiple linear regression equations.* 



Because variables in the behavioral sciences are often expressed in 
arbitrary \scales, not much substantive information about a pa\h analytic 
model i.s conveyed by non-standardized regression weights, which specify that 
Na 1.0-point change in the independent variables causes b points change in the 
/dependent variable. This is because the difvferent scale ranges of the indepen- 
dent variables obscure the importance of different variables relative to one 
another when the nonstandardized b-weights are used, i 

For this reason the standardized regression weights, p's or betas, are 
used as path coefficients tcj represent the direct effect of independent on 
dependent variables. Each coefficient estimates the amount of change in 
standard deviation units of the dependent variable that is produced by a I- 
standard deviation change in the respective independent variable (Amick and 
Walberg, 1975), ' , 

i) 



-4- , ^ 

2 

It is crucial to reali^ze that although the R and 'betas for any equation 



in the model are central to the analysis, their values rely on the correla- 
tions among the variables. The way in which correlations of differeqt . . 
magnitudes and signs form different patterns (^f^flflLationship's will dictate the 
character of the regression esfimate^r — ^variQty of patterns of correlations 
among dependent and independent \priables can exist, each of which has impllr- 
cations for the rragnitude of the regression estiiuates and their' substantive 
interpretation. ^ ' 

^^^ohen and Cohen (1975) describe the pattern characteristic of cooperative 
suppression as one of the most attract^ive for researchers to find,* Its appeal 

4 

is in the characteristic way the estimates get enhanced beyond what one would' 

expect from the correlations between Y pind each independent variable alone. 

For the two independent variable case, cooperative suppression comprises cases 

in which the independent variables correlate' positively with Y but negatively 
witri each other. 

.■■■- / - -* — ■> ■ ■ 



* 



J- Cohen and Cohen (1975) describe three "attractive^' - 

^ , . patterns whose appeal is in the characteristic way the regression 
es,timates get enhanced in magnitude beyond what th6 correlations between 
Y an4 each indepenijent variable would lead one to expect. ^ The pat^^tems 
Come under the general label of suppression arid are conveniently identified 
when each beta weight fills outside the range 0 to ry^^ - 

Classical suppression occurs when Vy^^ = 0,. r^^ >0, and r ^ = 0* 
Although X2 is unrelated to Y, using it in the regres^ipn equation increases 
R beyond Its value had only been used. The, absolute value of the bet^s 
are larger than the simple correlations wath Y; in particular, although X / 
is uncorrelated with Y, its beta weight does not equal zero. / 

iDf net suppression, although all correlations are positive, X2 sup- 
presse^-a portion of the variance in X that is irrelevant to C^correlated 
with)l Y and thereby^ increases R beyona what it weuld be if only X^. were used 
in thfe equation. The beta weights will fall outside the rang.e 0 to with 
the adyiition .that the beta fQ,r the suppressor variable will ■t)e opposite in - 
sign of its T^^ . 



I 



Examples of Cooperative Suppression ' ' - 

Cohen and Cohen (1975). describe an instance arising in personnel, sfelec-^ 
tion, A Director of vParsoiinel, as 'an attempt to establish a meai:is xif selecting 



sa|es persons from Amon-g a pool of i*fipplicants, draws a sample of c^nrrent sales- 
persons and QbtainS ratings of thei:^ overall success in sales performance, 
lnter\dew data^-suggest that social aggressiveness and habits and skills dn 

record keeping each constitute a major determinant bf ^ales, successes . 

* < ^ ' '*'"'•( 

Measures of these two .variables art administered to the sample, Results , 

reveal that' the correlation between social agg;ressiveneW' C^j^) alrd sales 

\ ^ 

" success, (Y) is- .29/ between record keeping (X^) atild sales success (Y) is 

r' 

:24, and = ""-^O Indicatitig that those high on social aggressi)^nerss tend 
to be low on record keeping s);ills. » 

In this example, high social aggressiveness ^ids j^to go along with 
high sales success but also wi|:h low record keeping skills, which itself is 
incompatible with high sales success. \^en a person is high o^ social 
aggressiveness, he also tends to register low record keeping, skills; that low 
standing cbntaminates or suppresses the true relationship (correlation) 
between social aggressiveness arid 'sales success. In order to determine the 
uniCiue relationships to the dependent variables one sPust control for the 
suppressing influence of the low standing on the other independent variable, 
Whefi that is done, the relationship between X^^ and Y increases beyond the 
size of their zero order correlation. ^ 

Tlie argument about cooperative suppression applies also to X^. That 
is, the relations'hip between and Y is also suppressed by the negative 
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correlation between X. and X and simiiarly becomes erihanced when controlling 
Randomly Occurring Cobperati've Suppression: Hazards to Oausal Inference ^ \ 



* .* If cooperative supprefision docs not e^cist in th^^ population, a chance, 
still exists t;hat it w'ill^be found in safiiples^. Out of a very large numbVo; , 
independent samples (^rawn from the same population, a number will exhibit 
cooperative suppression by chance alone, A certain proportion of those 
instances will contain significant inverse correlations between t*he indepen-- 
dent. variables, and lead one to infer that cooperative suppression irideed 

• does exist in the population and that the inflated regression estimates reflect 

» . ■ 

the true relationships. The remainder will contain nonsignificant negative 

r^^'^^^^ suggest any such inference. In either case, the negative corre- 
lations will still occur and act to inflate the estimates even -though the 
parameter for tl\e correlations is zero, ' / 

The potential for hazardous inferences in such samples is obvious • 
The enhanced values of the estimates give the investigator the false impression 
his independent variables explain a good deal of the variance in the depen- 
dent variable and are important and major causal influences because of the 
large betas they exhibit, ' 

The prospects for path analysis a.re^, d:j-sturbing for a variety of reasons , 
Those using the me tliocl^ traditional ly appear indifferent to, the magnitudes of 
■the total proportions; of variance explained by tlve independent vaofiables in . 



eaqh regression equation. However, wKat look-s like large path coefficients 
may in total explain little of the variation in the dependent variable, . - 

Furthermore, tjbie analysts tend to accept at 'face v6lue the relativo magni- 
tudes of the rf^h coefficients as a basis for assessing the causal importance [ 
or unimportance of direct^connections .between variables. More disturbing 1$ 
t-he fact that some equations in the model may contain enhanced estimates 
vKereas others do not. As a conseq^upnce, ^j^e paisf of the model may contain 
inflated estimates which .appear large and substantial in contrast to uninHated 
^estimates in another part of the model. The researcher's interpretations will 
then reveal "important" relationships among certain variables whose estimates 
occurred merely as a'IKinction .of chance fluctuations in sign and magnitude 
in the correlation, ^ • ' ' , 

Cooperative Suppression: Theoretical and Mathematical Considerations 

This s^tion wi 11 ^ demonstrate how the, par tialling process in multiple 

2 ' 
linear regression enhances the R and beta weights to their proper magni-^ 

tude-s. In order to provide a more encompassing perspective on the problem^ 

the discussion will deal/with the instances for which the independent variables 

are uncorrelated, then tkose for which they .are positively correlated, and 

.finally to the focus of the study, those for which they are negatively 

correlated. . ' ^ 

fffect- of Cooperative Suppression o n Total Prop ortion of Variance Explained 
. — J ' ^ ^ 

Vmen/a dependent variable is recress^id onto two uncorrelated indepen- 
dent variables, X, and X^,^ the total prbportdon of Y variance explained is 



simply the sum of the squared correlations of each Xj^ with V or v 

. ^' . ' * ' 2.2 2 m 

R r^j * r^, ■ (1) ' 

I • ' ' " . 

* VBecausp X and X are uncorrolated, each squared correlation teflects • 

f ■ , r 1 ^ 

the uiique contribution of the variance in the particular X tc the Y variance 
(Ker/fngcr and Pedhazur, 1973; Ainick and Walberg, 1975), ^/"J^^ 
Normally ohj finds correlations greater than zero" b^t^een the indepen- ' 
dent '^variables, in which ais^o formula (1) no longer applies, ^Vhen all 
correlations are positive, r^^^ and r'!^2 longer reflect unique contributions 
of Xj and to the Y variance. Rather , >part if the proportion of Y variance 
expl>\ned by one also involves part, of th^ proportion explained by the oth^V. 



By not partial lilng out this rcdund^t variance, ^one risks inferring that 
each isylexplaining a greater proportion of Y than it really is, 

Essentially, the partialling process is one of extracting from one of 

' the independent variables all information in it contributed by the other 
independ^int variable; then, the proportion explained by one independent 
<?ariable plus the proportion explained by the .other after the first has been * 
partialled^rom it combine to give the total proportion^f Y variance 
explained. Inspection of the formula for the total proportion of variance 
demonstrates how this happens. ^ 

« \Vhen a correlation exists between the independent variables'* formula C2) 
or (3), whid^^aro equivalent to each other, must be used (Keflinger and 

■ Pedhazur, 1973), although eacli reduces ,to formula CU ^N-hen r^^ ^' -Formula 



.where r is. the squared semi-partial correlation between X2 ^nd Y controlling 

for. X^j* It gives tfre proportion of variance added by explaining Y after 

2 

taking into accoujit that amount contributed by^ X^^ . Alternately, R is also 

given by the following formula: . ) . " 

^ ' - ^ 2 2-' 2 ^ 

where, r is the squared semi -pa'rtial , correlation of X^^ with Y controlling 
fbr^X^. Ttie formulas for the semi-partials themseTyes are, the key to how 
and where this shared variance gets extracted, 

^YC2.1) " ^Y2 " ^YL^12 • ^ - ^ 



12 
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In formula (4) for example, when r^^ is positive, ^Yl"'^i2 also posi- 
live and gets subtracted from r^^ t:hus taking into account the sharing of 
variance in Y due to the correlation between the independent vai'iables 
CKerlinger and Pedhazur, 1973), 

With ciooperative suppression'^ the presence of a negative correlation 

lends a peculiar twist to this /ration of extracting variance shared between 

2 2 

independent variables, IVhen r^^^ negative, r^j^ and Ty^^ ^gair^no longer 
reflect unique proportions of Y variance explained by and X2 respectively 
Tne negative correlation, however, indicates that the independent variables 



arQ, mutually suppressing some of the variance in Y each explains by itself, 

l(ather than extracting shared variance, the partialling process adds this 

"hidden" part of the unique variance back into each independent variable's 

^zero-order relationship with the dependent variable, 

Rl/ferring to formula C^X again for illustrative purposes, we see that 

the influence of th« ;^.egative correlation talces place in the term ryi'^l^ ' 

the numerator. V.T^n r^,^ is negative, ryi'^lZ also negative but its absolute 

value gets added to ^^2* iri effect increasing the correlation between and 

Y after controlling for X. . 

i * 

Table 1 presents some fictitious data to il lustrate the phenomenon 
characteristic of cooperative suppression. The left half presents the 
estimates when no correlation existie between the independent variables and 
serves as a comparison for what happens to them as the correlation becomes 
increasingly negative. 

2 

The table shows thal^ in all instances, the R increases as a function 
of taking into account the inverse relationship in the independent variables. 
As the correlation becomes more negative, obviously a greater portion of the 

unique relationship between independent and dependent variables is suppressed; 

2 

when that inverse relationship is taken ''into account, the R increases more 
and more above what would be expected if no correlation existed between the 
independent variables. For example, when r^^^ and r^^^ equal ,2, one doesn't 
expect the total proportion of variance to be greater than ,08, Yet, as a 

function of a negative r.^ the R increases beyond this by a minute amount 

> «' - J- ^ > 

to .088 when r^^^ = --1 and to a more substantial size of »2 when r^^ = -,6, 
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TABLE i 



FICTITIOUS "DATA ILLUSTRATING INFUTIONARY CHABACTERISTICS IN COOPERATIVE SUPPRESSIONi 
INFLATION IN REGRESSION ESTIMATES DUE'TO NEGATIVE r,. COMPARED TO r,. = 0 

HOLDING^ r^^ AND r^^ CONSTANT ' . 



^12 = Zero • ' ^ 


^12 Negative ' V 


■ , : 

n2 g 'g Standard Error 
ni ^Y2' . Y1.2 ^Y2.1 of Betas > 

I:^ ^ 


/12 


n2 g n Standard Error 
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The phenomenon apparently occurs whether or not r^j^ and r^^ are equal, For^ 
example, when r^^^ equals ,2 and r^^^ equals ,4, one typically expects to find 
an not larger than ,2; but as r^^ becomes increasingly negative C-.l to ^ 
-.4), the R deviates increasingly above ,2 from ,22 to ,31, i3hiselli C1964, 
'p, 311) nqted the same general trend as part o£ a discussion about prediction 
studies. 



Effect of Cooperative Suppression on the Beta Weights (Path Coefficients) 

2 

Table 1 suggests that, like R , the beta weights also increase as a 
fuikction of partialling oiit the irrelevant portion of variance in the inde- ^ 



pendent variables. The increase appears to be more dramatic than it is for 
2 

R , For example, in the absence of suppression effects when r^^ and r^^^ equal 
.3, one would normally expfect eacli beta to be no larger than ,3; but, as r^^ 
become^ increasingly negative from -.1 to -.6, each beta deviates above 
expectation frorij .33 to .75. Similarly, wfien r^^ = .2 and r^^ = ,4, one 
normally expects py^^ 2 ^° ^° larger than .2 and 3y2 ^° larger than 

.4; yet, as r^^ increases in negativity from -.1 to -.4, ^ increases fro 
,24 to .45 and ^3^2 ^ increases from .42 to .57, The formulas for the beta 



m 



weights provide some insight as to why thi^s happens. 
Fo:^ the two independent variable case. 



-4 



^Yl,2 " ^Yl ^ ^Y2^12 
1 ^ r^2 



and 



/ 
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Note their similarity with the formulas .for the semi-partial corfelation 
coefficients;, particularly, the numerators are identical to those for the 
corresponding semi -partial • * • ^ 

When r^^ equals zero, ^-g;^.^ ^ reduces to r^^ and 3y2,1 .^^^^^^^ ^^^12, 
Normally, with t^^ positive, the betas will be less than their respective 
correlations because the amount of variance shared between the independent 
variables gets partialled out of each. This is algebraicall^y manifested in 

the subtraction of ^^2^12 ^^^^ ^Yl ^Y1^12 ^^°^V^y2 ^'^^ * 

cooperative suppression the absolute value of the quantities ^yi^12 
r^^r^^ added in the numerator, in effect enhancing the magnitude of the, 
betas beyond r^^ and r^^, respectively. Thereby, the true magnitudes of the 
relatio^i^ips between independent and dependent ^variables are brought to the 
surface. 

Procedure 

The initial step was to generate, for three variables CX^^, and Y) 
a sample of scores which were randomly selected from a population in which 
the tl>ree correlations, r^2 ^yi ^Y2' °^ ^ pre-specified magnitude, 
Kaiser and Dickman present a method for randomly generating a sample 

correlation matrix from a given population matrix, 
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Invoking their procedure for 10,000 samples of size n » 100 and 

using the same population matrix of correlations, a ^tonte Carlo computer 

program was developed to generate three san^ling distributions, one for each 

correlation. The population parameters for the correlations**, their 

2 

respective beta weights and tRe total proportion of variance (R ) are: 

parameter value 
, .0000 

J - ^ ,2000 



( 



r^^ , .3500 . 

Py^ ^ .2000 

3vo .3500 

.1625 

For each sample, regression estimates were calculated ai^d^ whether or 
not each value was positive and statistically significant was checked., The 
ma^itude and direction of the differences g^^ - r^^ and ^y2 ^^^^ 

also recorded to demonstrate the discrepancies 'between the betas and their 
respective correlations in e^ch sample. All this information was stored 
on tape for later access. 



i 



This stud/ limited the sample size to, 100 to keep it in the realm af ; 
. sample sizes accessible to educational studies yet sill in the area normally 
used, iri path analytic studies. In view of the difficulties in obtaining 
' large samples in most educational research, an N of 100 seemed an appropriate 



size. - 



r 

For population values for r^^ t^^ and Appendix A, 



\ 
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\Tlie Monte Carlo method therefore provided the basic data from which 
to select out and analyze the incidence of cooperative suppression' effects , 
Canned programs in the Statistical Package for the Social Sciences (SPSS) ^ 
were then used to examine this data in more depth, 

'ResuTt^ 

Characteristieg of the Overall Distribution, ' r 

Across all samples, the averaged for each correlation and regression - 
estimate nearly equalled the population parameters and the standard deviations 
were quite small- Kurtosis and skewness deviated little from* zero indicating 
close approximations in form to the normal distribution. These, data are 
presented in Table 2. - . 

TABLE 2 

NlEANS, STANDARD DEVIATIONS, KURTOSIS, SKEWNESS, RANGES, MINIMA 
AND MAXIMA FOR CORRELATIONS AND REGRESSION ESTIMATES 
ACROSS ALL SAMPLES (N ' = 10, 000)^ 



















K '"12 




^Y2 


^Yl 


^Y2. 
















Mean 




.002 


.20' 


.347 


.20 


- .344 


.176 


Standard 


Deviation 


.099 


.097 


.087 


.091 


.086 


.066 


Kurtosis 


^ 


-.045 


-.103 


-.ou 


-.092 


.033 


.052 


Skewness 




-.017 


-.111 


-.193 


-.099 


-,172 


.357 


Range 




.759 


.748 


.629 


.728 


,626 


.453 


Minimum 




-.383 


-.161 


.014 


-.172 


.009 


.004 


Maximum 


Js 


.376 


.587 


.643 


.556 


.635 


.457 
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Table 3 presents tl\e distributions in terms of the percentage of cases 
falling into + 1, 2, and 3 standard deviation intervals around the parameter. 
The midpoint for each segmented line has been set equal to the appropriate 
population parameter and standard deviation units marked off on each side, 
The parameter values appear in parentheses adjacent to thfe name of each 
estimate. Consult Table 2 for the appropriate value of the standard deviation 
for the correlations and regression estimates. Entries are the percentage 
of samples ^falling in the specified interval, e,g,, 33,8% of the sample r^^*^ 
were within 1 standard deviation below the parameter and 13,4% were between 
-1 and -2 standard deviation, 

' V\lien V-orrelations are sampled frop^a population in which the relation- 
ship is pther than zero, the sampling distribution tends to be skewed, as 

\ suggested by Table 3, As a further check on the performanc^ of the Monte 
Carlo program, distributional properties for ry^ and r^^ ^^^^ compared with ^ 
those for the normal distribution with mean equal zero and standard deviation 

coequal 1, that is NC0,1). In order to express such distributions in terms of, 
a' normal distribution, first it was necessary to employ the Fisher r to Z 
transformation of the correlations. The transformation is: 

Z = 1/2 log p ^xy 

VI - V, 

The res,ulting sampling distribtuion will be approximately normal with a 
standard devi 



aation of jl 



N-3 

The distributions of- the transformed r^^ and r^^ values for the 10,000 samples 
closely approximated the nonnal ' distribution. 



20 
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TABLE 3 



FEBCENTAGE OF ALL SAMPLES FALLING INTO - 1, 2 AND 3 STANDABD 
DEVIATION INTERVALS ABOUND THE PARAMETER FOR CORRELATIONS aT 
REGRESSION ESTIIIATES (N = 10»^)00) 



_2 

3D 



.-1^ ■■ Parameter +1 
c-i Value . SD 



(0.0) 



(.20) 



2.1 13. ij- ' 33.8 

I » ' ' 



2.^ I 13.5 ) 33.3 



(.35) ^ 2.6 ^ 13.8 ^ 32.9 



33.3 



^ (.35) . 2.6 U 



33 



R^^' (.1625) 



,3 ^ 10.2 ^ 33.5 



+2 
SD 

-4- 



+3' 
SD 



-1 



, IkJ^ 2.0 

J -I' 



3^.5 



1-^.3 . 1^8 
-__J 1 



35.^ ' , 13.5 . 1.^ . 
* — • 1 



3^.3 , 1^.3 



35 



2.0 



13.3 , -1.6 . 

1 H 



3^.€ ^ 16.8 ^ 3.8 ^ 
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'Incide(iGe pf Cooperative Suppression 

Out of the 10,000 samples, the pattern chara<^teristic of cooperative 
siqppression occurred in 48% or |l^763 samples* Of these 91,2% C4,343) had 
a nonsignificant r-^^^' which is 43,4% of all 10,000;^tHus, aboijt 43 out of 
100 cases had the pattern characteristic of cooperative suppressi^on but 
would not allow one to infer it exists in the population. The remaining 
•8.8% (420) or 4.2% of the entire 10,000 cases had a significant r-,^ for a 
one-tailed test with ^Ipha = »05; that isS, about 4 out of 100 samples did/ 
allow one to ^fer the presence of cooperative suppression, even though it 
did not actually exist in the papulation 



Discrepancies Between Betas and Correlations / 

IVhen the pattern of cooperative -suppression has been found, regardless 
of whether or not r^2 significant, one expects the beta weights to be 
larger than their respective correlations, and this proved to be true for the 
present data. The information given below illustrates the general sizes of 

T discrepancies, even though none actually existed in the population. 
\ In each sample generated during t;he Monte Carlo routine, the correla- 
tions, r^^ and rY2> were subtracted from tlieir corresponding beta weights, 

and L-^^- Table 4 presents the means, standard deviations, ranges, minima 
and maxima for the resulting differences in estimates for patterns of cooper- 
ative suppression only. The mean differences depict the amount of inflation 
that tends to occur as a function of the randomly occurring negative correla- 
tions between the independent variables. In general, the mean differences 
across all patterns of cooperative suppression are sliglit. ^'"v 



The range of differences in regression' estimates suggests .the possi- 
bility that an occasional case may >(^eld^ marked discrepancies, but their 
standard deviations suggest that the occurrence of maximum differences 
attained in this study is rare (5 to 7 standard deviations above the mean- 
dif ference]j|^ 

TABLE 4 , - " ' 

ALL. SAMPLES WITH PATTERN OF COOPER^MIVE SOTPRESSION: ' MEANS, STANDARD 
DEVIATIONS, RANGES, MINIMA AND >1AXIMA FOR DIFFERENCES 'bETIVEEN 
CORRELATIONS AND RESPECTIVE BETAS (N = 4,763) 



Yl '^Yl 



Mean 

Standard Deviation 
Range 
Minimum 
i Maximum 



.027 

42\ 



.lf[6 
.000 
.146 



^Y2~^Y2 
7016 

.015 

.120 

.000 

.120 



^/Table 5. presents the means, standard deviations, ' ranges, minima and 
maxima for the differences in estimates for patterns of cooperative suppres 
sion with a significant Although their discrepancies were larger than 

those for cases with a nonsignificant r.^, these instances comprised only 
qjbout 9% of allf^ cooperative suppression patterns. , In light of the small 
standard deviations, tWi's information .makes it clear that the odds we;re low 
for obtaining a sample with a cooperative suppression pattern in which the 
r Was significant and in which the estimates deviated markedly from the 
parameter. ' 



TABLE S 



SAMt>LES WITH PATTERN OF COOPERATIVE, SUPPRESSION AND SIGNIFICANT 
r^^: MEANS, STANDARD HEVIATIONS, RANGES, MINIMA AND 
MAXIMA FOR qiFFERENCES BETWEEN .CORRELATIONS 
AND RESPECTIVE BETAS' CN = 420) 



Mean . ; .'072- .044 

Standard Deviation .022 .018 

Range^" .124 .113 

Minimum ,022. ,007 



• Maximum ,J:46 .120 

Magnitudes of the Regression Estiniates 

In conjunction with the expectation th^t the betas would be larger than 

V ■ • 

the correlations, the regression estimates in patterns of cooperative suppr^s- 

sion were expected to overestimate their parameters. That is, because of the 

enhancing effect of the negative ^-^2' mean f6r the g^j^'^* ^Y2*^ 

R 's were expected to .be larger ^han their populatiij'n values. This was found 

not to be true in the data of this study,* The sets of information below 

elucidate different aspects' of the data concerning the sizes of the sample 

estimates; 

Table 6 presents tlw means, standard deviations'\^ ranges, minima: and 
maxima for the regression jestimates fpr ^o 4, 763Vpatterns of cooperative 
suppression. The means for all t'hree estimates appear to be quite accurate 
approximations to tlie paran\etgrs'. ' • , 
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TABLE 6 

« 

ALL SAMPLES WITH PATTERN OF COOPERATIVE SUPPRESSION: 
MEANS, STANDARD DEVIATIONS, RANGES^ MINIMA" 
AND MAXIMA FOR REGRESSION ESTIMATES 
CN « 4,763) 



Mean, .•207- ^348 , ' .167 

Standard Deviation .084 .087 , .065 

Range • \ .490 .588 .412 

Minimum ■ .007 .047 .015 

Haximum .497 ,635 »427 



Although the minimum and maximum values attained for all correlations 
and estimates deviate markedly from the population parameters, the incidence 

of such extreme values was rare. Over all 4,763 instances of cooperative* 

2 

suppression, about 70% of the sample By'^'s, ^yz'^* ^ ^^'^'^ within +^ 1 

« 

standard deviation of their parameters and about 95-98% fell within 2 stan 
dard deviations of their parameters, 4 ^ 

IVhen cast in light of all 10,000 samples, these percentages become 
reduced by about half » Generally^ about 32-34% are within +^ 1 standard 
deviation of their parameters and 45-47% are within ^ 2 standai'd deviations 
of their parameters. 

Table 7 presents the meiuis, standard deviations, ranges, minima and 
pjaxima for the 420. cases of cooperative suppression with^ significant Xy^Sy 



[ 



The distributional data indrcate? that about 64-67% of these 420^ 




estimates were withifi +^ 1 standard deviation of their population parameters* 



and 87-97% were within +-2 standard deviations, This suggests th^t the 
estfimates for what one wouFd infer to be genuine cooperative suppression 
situations still tend to fall close to their parameters eVen though they 
are .subject to inflationary effects due to statistically significant negative' 
correlations between the independent variables, 

TABLE 7 

S/VMPLES WITH PATTERN OF COOPERATIVE SUPPRESSION AND 
SIGNIFICANT r'. MEANS, STANDARD DEVIATIONS, 
RANGES, MINIMA AND MAXIMA FOR REGRESSION 
•^X; ESTIMA.TES (N = 420) 

^Yl ^2 ^ 



2 



I. 



* • Mean , ^ .214 .351 .152 

Standard Deviation * .078 .091 .064 

Range .396 .466 .341 

Minimum .040 .110 .024 

Maximum ,436 .576 ' .365 

Out of all 10,000 samples, about 3% had a significant pattern of cooper- 
ative suppression with estimates within + 1 standard deviation of their 

population parameters and about 4%fwith estimates within +^ 2 standard deviations. 

"A 

Magnitudes of the Correlation 

Briefly, then,, the inflated betas in cooperative suppression patterns 
were larger than their correlation counterparts but still tended to be rela- 
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^tiyely good estimates of their own parameters. This could mean only that 
* thtS sample |yprrelations, r^^^ and v^^, in these patterns of cooperative 



suppression tended to underestimate their respective parameters. In the ' 
discussion below, different aspects of the data concerning sizes of the 
coronations show this to be true. 

Table 8 presents the means, standard deviations, ranges, minima and 
maxima for the correlations for the 4,763 patterns of cooperative suppression, 
The. mean for about 1 standard deviation below the population para-- 

meter, but that was expected since all selected cases had no positive ^^^2'^, 
The means for Vy^^ and r^^^ were somewhat lower than their parameters, 

TABLE 8 

ALL SAMPLES WITH PATTERN OF COOPERATIVE SUPPRESSION: 
MEANS, STANDARD DEVIATIONS, RANGES, MINIMA 
AND MAXIMA FOR CORRELATIONS (N = 4,763) 



12 



^Yl 



^Y2 



Mean 

Standard Deviation 

Range 

Minimum 

Maximum 



.077 
.058 
.340 
,340 
,000 



.181 
.086 
.475 
,000 
.475 



.331 
.088 
.604 
.028 
.632 



Although the minimum and maximum values attained for all correlations 
deviated markedly from the parameters, the frequency of such extreme values 
was low. Over the 4,763 cases, about 70% of the negative r^^^'^ were 1 stan- 
dard deviation below zero, 95% were 2 standard deviations below and 99.8% 
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were 3 standard deviations below. About 67-70% of the sample ^yi'^ ^Y2'^ 
fell within +^ 1 standard deviation of their parameters aT>d about 95-99% fell 
within +^ 2 standard deviations • 

IVhen cast in light of the 10,000 samples, those percentages become 
reduced by about half. For r^^ and rY2 about 32-^4% were within 1 standard 
deviation of their parameters and 45-57% within +^ 2 standard deviations. More 
of the ^Yl'^ ^X2^ that randomXy occurred together with the negative ^^^2'^ 
were below than above their par^eter%. Because of this slight discrepancy 
in distributional properties for correlations and betas and the consequent 
average underestimation by the correlation^, the amount of inflation due to 
the negative was, on the average, sufficient to make the averages of the 
sample betas accurate population approximations. 

Table 9 presents the cor:^lation means, standard deviations, ranges, 
minima and maxima for the 420 cases of cooperative suppression with significant 
^Yl'^ when using a one-tailed test and alpha = .05. The means for the correla- 
tions are even farther below their parameters than they are for all instances 
witlt the cooperative suppression pattern or those with the pattern and non- 
significant ^]^2'^- This distributional data shows that 60-63% of the ryj^'s 
and rY2's were within +^ 1 standard deviation of their population parameters 
and 91-98% within +^ 2 standard deviations. As before, these percentages suggest 
\he correlations tended to fall closely to thei^^ parameters, but they are 
soWwhat misleading by themselves because a disproportionate percentage of , 
of cases fall at the lower end of the distribution. Althoughuthis observation 
was noted above, it is more marked for these 420 instances. 

K 



TABLE 9 

SAMPLES WITH PATTERN OF COOPERATIVE SUPPRESSION AND SIGNIFICANT 
r MEANS, STANDARD DEVIATIONS, RANGES, MINIMA AND 
MAXIMA FOR CORRELATIONS Qi = 420) 

f 

/ ^12 ryj :^Y2 



Mean ' -.205 .142 .305 \ 

- Standard Deviation .035 .080 * . .092 

Range . ' .174 .393 .464 

Minimum -.340 .003 .070 

' Maximum -.166 .396 .534 

Nearly 3% of all 10,000 cases had a significant pattern of cooperative 
suppression with correlations within +_ 1 standard deviation of their para- 
meters and about 4% within j*^ 2 standard deviations. 

Sununary 

From a population in which cooperative suppression was absent and ry^ = 
• 2, "-^^ ^^^\'^i2 " ^* 10,000 random samples of sire 100 were generated for 
a 2 independent-1 dependent variable system and correlations and regression 
estimates calculated for each sample. Nearly 48% of the samples yielded the- 
pattern characteristic of cooperative suppresion, i,e. t^^ and ry^ positive 
in sign r^^^ negative. Only about 9% of these, however, were found to have 
a statistically significant negative correlation between independent variables 
for alpha = .05 (one-tailed tes't). This comprised about 4% of all 10,000 
samples and reflects the overall incidence in which one would incorrectly 
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infer that cooperative suppression exists in this particular population. 
The remaining 91% of the samples with patterns of cooperative suppression, 
or 43.3% of all 10,000, would not have allowed such an inference. 

Although the betas were larger than their counterpart correlations, the 
observation of Pyj^^s a'hd ^y2'^ grossly discrepant from the r^j^'s and '^y2^^ 

respectively was rare. Fi\rthermore, the betas did not, on the average, over- 

» 2 
estimate their parameters. On the contrary, the means for &yi* ^Y2 ^ 

quite accurately approximated their paralueters, and the majority of sample 
values for each tended to cluster close to their parameters. 

The means for r^^^ and r^^ were lower than their parameters and their 
distributions were som,ewhat weighted on the low end. Apparently, a dispro- 
portionate number of ^y^^'s and*rY2' falling below their parameters occurred 
in samples with aiegative "^12^^^ 

^ The averages for the correlations were not fi4ghly discrepant from their 
parameters nor were the distributions highly skewed* The largest discrepancies 
and unbalance in the distributions occurred in those samples of cooperative 
suppression with significant rj^2's* 

In general, then, there occurred a slight discrepancy in the distribu- 
tional properties for correlations and betas, the correlations being positively 
skewed, and an average underestimation of the parameters by the sample corre- 
lations. The magnitudes of inflation attributable to the randomly occurring 
negative ^^^2*^* though slight, were on the average sufficient to make the 
means for the sample betas accurate approximations of their parameters. 
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Implications for Path Analysis 

The original concern was over the possibility of a path analyst empiri- 
cally finding a pattern of cooperative suppression among some variables in 
the mddel although it did not exist among, them in the population. Tire related 
causal effects which the ^analyst would infer to be large would in fact have 
occurred as the result of inflationary effects attributable to the presence 
of a randomly occurring negative coixel at ion between the independent variables. 
Unknowingly, the researcher would conclude that these relationships in the 
model reflected causal influences much larger than actually existed. The 
results of this study do have somfe implications about such ppssibilities but 
tenta-^ively must be limited to causal models with two^^independent variable 
equations and a population in which variables are moderately related and 
cooperative suppression is nonexistent. * 

The possibility of finding the pattern is good but the, need for concern 
over grossly inflated estimates appears minimal. If the pattern occuri^d 
among a set or sets of variables in the model, the betas would not tend to be 
grossly exaggerated. This apparently is true regardless of the magnitude of 
the negative r,^ Although large estimates did appear as a function of the 
randomly oc^curring negative r^2* the possibility of obtaining estimates of 
such magnitudes was low. Indeed, the majority of values for the regression . 
estimates tended to congregate near their parameter values. Therefore, the 
data suggest that the path analyst need not be concerned over the inflationary 
effects on the bQtas and total proportion of variance explained should a pattern 
of cooperative suppression unexpectedly appear in the sample data. 



■ " \ • ■ . . . ■ 
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Implications for Future Regearch 

To' determine the generalizability of thQ results, two courses of action - 
should be taken, One is to perform a variety of analyses of this same type , 
using a sample size of 100 and' two independent^variables but with different 
positive population correlations between independent and dependent variables. 
Beyond ti^at, studies could also vary the magnitude of a negative population 
correlation between independent variables. One would 'then be investigating 
the -incidence of suppression and magnitudes of estimated that ensue when 
cooperative :suppression does exist ^in the population, - 

The second is to repeat all thos^ analyses with differing correlations 
vfor different sample sizes. In particular, smaller sajaple sizes would be 
more meaningful to educational research since researchers are often constrained 
by the necessity to use units of analysis of which they can obtain only a 
small number, such as classrooms and schools/ 
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APPENDIX A 



Population Values for Xy^y r^^ and t^^ 



It xs interesting and pertinent to path analysis to look at a popula- 
tion in which some degree of relationship between each independent variable 
and the dependent variable actually does exist* The use of path analysis 
presumably deals with a plausibly accurate model in which the relationships 
have been formulated on the basis of other empirical results in conjunction 
with theoretical substance. Chances are that some form of the hypothesized 
relationships do indeed exist in the population, 

. Other aspects of this study favored the use of population correlations^ 
for r^^ and t^^^ greater than zero. For N = 100 and alpha - .05 (one-tailed), 
the critical size for the correlation is .165; most r's generatedimder a 

population correlation which equal zero would be con/icJ^ably smaller than 

^ / 
this. The small sizes of a majority of the sanple t^j^ ' s and ^^^2^^ would have 

. been substantively uninteresting and produced regression estimates whose mag- 
nitudes would probably be ignored in a lot of path analytic studies. > Therefore, 
making the population parameters for r^^ and t^^^ gi^eater than zero was judged 
appropriate. ^ 

From a purely technical standpoint, sampling from a population in which 
these two congelations are positive and the correlation between the independent 
variables is zero increase's the chances of randomly obtaining patterns of 
cooperative suppression. Because the study intended to examine the incidence 
of cooperative suppression in samples when in fact it did not exist in the ^ 
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population, the population parameter for r^^ was not made negative. Two 
factors primarily determined the choice of parameter values for r^^^ and r^^^ 
In order to obtain, a majority of correlations in tKe neighborhood of those 
typically found in studies, it was decided not to ipake the degree\of relation- 
ship in the population too large. To make the study more interesting, the 
parameters for r^^ and wete made different from each otl^^^. With the ^ 
above considerations in mind, \hi5 investigation used arbitrarily selected 



^population parameters as follows: ^ ^* ^yj '^^^ 



( 
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A Mtonte Carlo Investigation of Spuriously Inflated Regression Estimates 
THO^^A.S D. JOVICK, Centqr for Educational Pblicy and Management 



This study used a Monte Carlo simulaticyfi to ascertain the degree of 
inflation that can occur in regression estimates when samples contain ran- . 
domly occurring instances of a pattern among correlations called cooperative.-, 
suppression.. Ten thousand samples of ''scores on three Variables v^e randomly 
drav^Ti froa a population in which the correlations among, the variables were 
prespecified such, that cooperative suppression did not exist. Cooperative 

suppression occurred in nearly- 48% of the samples but the incidence of 

g ■ * 

regression coefficients grossly discrepant from the population parameters^, 

was rare. Discussion centers around the implications for multiple linear 

■ — < ■ 

regression and a method o£ causal investigation called path analysis. 
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